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(54) Data processing circuH, microcpmputer, and electronic equipment 



(57) The data processing circuit of this invention 
enables efficient description and execution of processes 
that act upon the stack pointer, using short instructions. 
It also enables efficient description of processes that 
save and restore the contents of registers, increasing 
the speed of processing of intenrupts and subroutine 
calls and returns. A CPU that uses this data processing 
circuit comprises a dedicated stack painter register SP 
and uses an instruction decoder to decode a group of 
dedicated stack pointer instructions that specify the SP 
as an Implicit operand. This group of dedicated stack 
pointer instructions are implemented in hardware by 
using general-purpose registers, the PC. the SP. an 
address adder, an ALU. a PC incrementer. internal 
buses, internal signal lines, and external buses. This 
group of dedicated stack pointer instructions comprises 
SP-relative load instructions, stack pointer move 
instructtons, a call instruction, a ret instruction, a 
sequential push instruction, and a sequential pop 
instruction. 



FIG J 



\FIGJA \FIGJB\ 




Q. 

UJ 



Primed by Rank Xerox (UK) Business Services 
2.14.2304 



EP0809180A2 
FIG. IB 



^ I General-Purpose 



Rta 



registers 



52 




Zero 
detector 



Memory 
(ROM) 



Memory 
(RAM) J4 

50 



ecu 



60 



JO CPU 

' (esecuKon means) 



2 



EP0 809180A2 



Description 

Background of the Invention 
5 Reld of the invention 

This invention relates to a data processing drcurt. a microconputer incorporating this data processing circuit, and 
electronic equipment that is constructed by using this microcomputer. 

10 Description of Related Art 

In the prior art a RISC microcomputer that is capak)le of manipulating 32-bit data uses fixed-length instructions that 
are 32 bits wide. This is because the use of f ixed-length instructions enables a reduction in the time required for decod- 
ing the instructions in comparison with the use of variable-length instructions, and it also makes it possible to reduce 
IS the size of the circuitry of the microcomputer. 

However, even with a 32-bit microcomputer, it isn't always necessary to use the full 32 bits. Thus, if alt of the instruc- 
tions are written as 32-bit instructions, many of them will contain unused portions so that menfx>ry will be used ineffi- 
ciently 

The present inventors have investigated tiie idea of a microcomputer that handles fixed-length instnjctions of a bit 
20 width that is narrower than tiiat of the executable data, to improve the usage efficiency without making tiie confrol cir- 
cuitry more complicated. 

However, simply cutting 32-bit fixed-length instructions to fit into a fixed 1 6-bit length, for example, causes problems 
as described below. 

An important feature of a RISC microcomputer is the use of general-purpose registers, whose contents have no 
25 pre-assigned meaning. For tiiat reason, when a stack pointer is used, one of the general-purpose reigister is used as 
the stack pointer and an instruction that acts upon a general-purpose register is used to implement stack operation. 

When coding a process tiiat takes data specified by a memory address obtained by adding a predetermined offset 
to tiie stack pointer and transfers it to a predetermined register, using an instruction tiiat acts upon a general-purpose 
register, for exanple. it is necessary to include within the object code of that instruction tiie offset data specifying tiie 
30 predetermined register, and data specifying tiie register to be used as the stack pointer. 

Thus, if an instruction that acts upon a general-purpose register is used when coding a process that acts upon tiie 
stack pointer, a large amount of data has to be specified in the object code, making it difficult to code the details of the 
instruction wittiin a 16-bit fixed-lengtii. Increasing tiie instruction length to 32 bits, for example, wouti result in a large 
number of instructions that do not particularly need tiie entire 32 bits, so that many instructions will contain unused por- 
35 tions, leading to a deaease in the efficiency wrtii which memory is used. 

In addition, increasing the instruction lengtii will also increase the amount of memory required for storing such 
instructions, so it is preferable from tiie point of view of efficient use of memory to not only use fixed-length instructions, 
but also make the instructions as short as possible. 

When a program is executed that was written in a language that secures a storage region for auto-variat}les finked 
40 to the stack pointer, such as programming language C, there are many instructions tiiat act upon tiie stack pointer, so 
it is preferable to have efficient coding and execution of instructions that act upon the stack pointer. 

In tiiat case, it Is tiierefae preferafcrfe to have an architecture that enables the coding and execution of instructions 
that are as short as possible, when executing a process tiiat acts upon the stack pointer. 

It has become common recentiy to incorporate general-purpose registers internally, particularly with a RISC CPU, 
45 to increase capabilities. Providing a large number of registers internally makes it possible for many processes to be exe- 
cuted rapidly within the CPU itself, witiiout having to access memory If such a targe number of internal registers is pro- 
vided, a large number of registers have to be saved during the process of register save and restoration when an 
interrupt is processed or a subroutine is called. 

The description now turns to a prior-art example of instructions tiiat save and restore the contents of registers, fre- 
50 quentiy used when entering and leaving a subroutine, even within stack-related instructions. 

The instruction set of a microoomputer usually has instaictions for saving and restoring tiie contents of registers in 
the CPU to and from a stack provkied in memory. These are either dedicated instructions or instiiictions tiiat address 
reqisters indirectiy 

In tiie Intel 80386 chip. push, pusha, and pushad instructions are available for writing the contents of registers to 
55 the stack, and pop. popa. and popad instructions are available for returning data from tiie stack to registers. 

When tiie contents of a register are written to the stack by tiie push instructk>n, tiie register has to be specified as 
an operand, such as 'push EAX/ This example concerns a 32-bit register EAX. If tiie contents of registers EAX, ECX, 
EDX. and EBX are all to be written to the stack, this push instruction must be repeated, as follows: 
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pushEAX 
push ECX 
pushEDX 
pushEBX 

5 

If such push and pop instructions are used for each of the registers, the size of the object code increases and a 
large number of program execution steps is required, leading to delays in the execution time or processing of the pro- 
gram. 

That is why the pusha or pushed instruction is used to write the contents of all eight of the general-purpose regis- 
10 ters of the 80386 to the stack, pusha operates on the lower 16 bits of each of the eight registers, and pushad operates 
on the entire 32 bits. The use of pusha and pushad can remove the need to repeat the push instruction eight times. 
The pop, popa, and popad instructions act in a similar manner. 

Disadvantages of repeating the push instruction include the increased length of program code and the delays in 
execution caused by executing a fetch for each instruction. From this point of view, the pusha and pushad instructions 
75 provide a large improvement when writing the contents of all eight registers to the stack. However, this Is not an advan- 
tage when writing fewer than the eight registers, such as four or six registers. 

In other words, if the pusha, pushad. popa. arxj popad instructions of the 80386 operate upon alt of the registers, 
instructions with delayed cycle lengths must be used when there is no need to save and restore the contents of all of 
the registers. In such a case, one instruction will suffice, but this gives rise to a problem in that the execution cycle of 
20 that instruction takes too long. 

Similar saving and restoration of the program counter is necessary when program flow branches to a subroutine 
and returns from the called routine, using instructions such as the call instruction and ret instruction. With a RISC CPU 
of the prior art, these processes are implemented by software. In other words, the program counter is saved and 
restored by executing assembler instructions (object code) that declare these processes. This leads to an increase in 
25 object code of the call and ret instructions, and the execution of a fetch for each instruction makes ttie execution speed 
sluggish. 

SMmmary of the Invention 

30 An objective of this invention is to provide a data processing circuit, microcomputer, and electronic equipment hav- 
ing an architecture that enables efficient coding of processes tiiat act upon the stack pointer within a short instruction 
length, for execution. 

Another objective of this invention is to provide a data processing circuit, microcomputer, and eledronic equipment 
that enables efficient coding of processes that save and restore the contents of registers, witii fast processing of Inter- 
35 rupts and subroutine calls and returns. 

In order to achieve the above objectives, a first aspect of this invention provides a data processing circuit compris- 
ing: 

a dedicated stack pointer register that is used only for the stack pointer; 
40 decoding means for decoding object code of a group of dedicated stack pointer instructions which have object code 
specifying the dedicated stack pointer register as an inplicit operand and which relate to processing based on the 
dedicated stack pointer register, and for outputting a control signal based on the object code: and 
execution means for executing the group of dedicated stack pointer instructions based on the control signal and the 
contents of the dedicated stack pointer register. 

45 

In this document, "object code" generally refers to program code obtained as a result of translation into machine 
language by a translation program. With reference to the present invention, it is used in a broad sense to comprise all 
program code written in machine language, irrespective of whetiier ttiis is done by a translation program. 

The data processing circuit of this invention is configured that a group of dedicated stack pointer instructions having 
so a dedicated stack pointer register that is used only for the stack pointer and acting upon the dedicated stack pointer reg- 
ister are decoded and executed. 

Since this group of dedicated stack pointer Instructions has dedicated op code that specifies manipulation of the 
dedicated stack pointer register, it is not necessary to specify the stack pointer in the operands of the ohiect coda That . 
is to say. this group of dedicated stack pointer instructions uses the dedicated stack pointer register as an implfoit oper- 
55 and. This makes it possible to write shorter instructions for manipulating tiie stack pointer, in comparison with an stack 
pointer manipulation in which one of the general-purpose registers is allocated to the stack pointer and instructions that 
act upon that general-purpose register are used. 

The present invention therefore makes it possible to provide a data processing circuit tiiat is capable of coding and 
executing shorter instructions for processes tiiat act upon the stack pointer. It also enables ttie provision of a data 
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processing circuit that makes efficient use of the memory. 

In a second aspect of this invention, the group of dedicated stad^ pointer instructions comprises a load instruction 
having data for specifying a transfer register within object code; 

5 the decoding means decodes the load instruction; and 

the execution means performs at least one of a transfer of data from a given first area in memory to a given first 
register and a transfer of data from the given first register to the given first area, based on a memory address spec* 
if ied by the dedicated stack pointer register and a register address specified by the data for specifying a transfer 
register, when the load instruction is executed. 

10 

In this case, the load instruction comprised within the group of dedicated stack pointer instructions is an instruction 
for transferring data between memory and a register, which means at least one of a transfer from memory to the register 
and a transfer from the register to memory. Note that this concept also comprises address data, irrespective of the con- 
tents of the data. A memory address Is an address for specifying an area in memory during data transfer. 

15 Since this load instruction has dedicated op code that specifies an operation relating to the dedicated stack pointer 
register, it is not necessary to have data specifying the stack pointer in the operands of the object code. It is therefore 
possible to code within a short instruction length when performing transfers of data between a register and an area in 
memory having a memory address linked to the stack pointer 

In a third aspect of this invention, the load instiuction comprises offset data within the object code that is data relat- 

20 ing to an offset for specifying the address of the first area in the memory; and 

the execution means determines the memory address aocordng to the contents of the dedicated stack pointer reg- 
ister and the offset data. 

25 In this case, offset data could be an offset that is specified directly as immediate data, or it could be specified indi- 
rectly by means such as specifying the address of a register or the like containing the offset When this load instruction 
comprising offset data Is executed, the memory address necessary for the data transfer is determined on the basis of 
the contents of the dedicated stack pointer register and the offset data. 

This means that instructions can be coded within a short length, for the transfer of data between a register and an 

30 area in memory having a memory address that is determined on the basis of the stack pointer and the offset data. 

This invention makes it possible to specify any area in the stack, by specifying appropriate offset data, even with a 
data processing circuit that is configured in such a manner that the stack pointer always Indicates a word boundary. 
Thus the data can be stored In the stack efficiently according to size, enabling an increase in the usage eff idency of the 
stack. 

35 In a fourth aspect of this invention, the offset data comprises immediate offset data and data size information relat- 
ing to the size of given data in memory; and 

the executk)n means creates an offset by performing a leftward logical shift on the immediate offset data, based on 
the immediate offset data and the data size information, and determines the memory address by adding the offset 
40 to the contents of the dedicated stack pointer register. 

In this case, the immediate offset data is an offset ttiat is specified directiy by immediate data. The data size infor- 
mation is tiie size of data in memory to be transfen-ed. The data size is usually represented as 2" (where n is greater 
than 3). such as 8-bit byte data. 16-bit half-word data, or 32-bit word data. Addresses In memory are given in byte units. 
45 so tiiat half-word data is placed at half-word boundaries and word data is placed at word tx)undaries. This means that 
the least significant bit of the menxjry address of half-word data is 0 and the least significant two bits of the memory 
address of word data are 00. Since tiie address of the stack pointer indicates a word boundary, the least significant bit 
of the offset is 0 when aeating a memory address for half-word data and the least significant two bits of the offset are 
00 when aeating a memory address for word data. 
50 A leftward logical shift means moving tiie bit string of tiie data to the left, and inserting zeros Into the bits on the 
right-hand end of tfie data ttiat are made vacant by the shift (shift-in bits). 

This aspect of tiie invention makes it possible to code immediate offset data witiiout the lower bits that are deter- 

mined unannbiauously by ttie data size, because tiie immediate offset data is shifted to ttie left in accordance with tiie 

data size. Tnis means that tiie Immediate offset data can be speof led eff Idently. and larger offsets can be specified than 
55 in cases in which the data size is specified as Is. even if tiie data size is not bytes. 

The use of these instructions makes it possible to select boundary locations tiiat are matched to the size of data 
when that data is written to memory or read therefrom. 

In a fifth aspect of tiiis invention, the group of dedicated stack pointer instructions comprises a stack pointer move 
Instruction for having move data within ttie object code thereof and for moving the stack pointer; 
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the decoding means decodes the stack pointer move Instruction; and 

the execution means modifies the contents of the dedicated stack pointer register based on the move data when 
the stack pointer move instruction is executed. 

5 Since this stack pointer move instruction has dedicated op code that specifies manipulation of the dedicated stack 
pointer register, data for spedfyrng the stack pointer in the operands of the object code is not necessary. This makes it 
possible to code within a short instruction length, when it is desired to move the stack. Thus the amount of instruction 
coding can be reduced for processes involving data stored in the stack and data that is stored linked to the stack pointer. 
Since this aspect of the invention makes it possil^e to move the stack in a simple manner, it is particularly effective 
10 for processes that secure different stack areas for different individual routines. In other words, addresses can be spec- 
ified over a wkie region by moving the stack pointer for each routine, as appropriate. 

In a sixth aspect of this invention, the move data comprises immediate data for the stuck pointer move instruction: 

and 

75 the instruction execution means performs at least one of a process of adding the Immediate data for the stuck 
pointer move instruction and the contents of the dedicated stack pointer register and a process of subtracting the 
immediate data for the stuck pointer move instruction from the contents of the dedicated stack pointer register. 

This aspect of the invention makes it possible to code shorter instruction length for processes that move the stack 
20 pointer upward or downward by an amount specified by the immediate data for the stuck pointer move Instruction. 
In a severrth aspect of this invention, the data processing circuit further comprises: 

a plurality of registers provkled in a contiguous sequence: 

wherein the group of dedicated stack pointer Instructions comprises at least one of a sequential push 
25 instruction and a sequential pop Instruction having data for specifying a plurality of registers in tiie object code 
thereof: 

the decoding means decodes at least one of the sequential push instruction and the sequential pop instruction; and 
the instruction execution means performs at least one of a process of executing a plurality of sequential pushes of 
data from the plurality of registers to a stack provided in memory and a process of executing a plurality of sequential 
30 pops of data from the stack to tiie plurality of registers, based on the contents of a memory address specified by 
the dedicated stack pointer register and the data for specifying a plurality of registers, during the execution of at 
least one of the sequential push instruction and the sequential pop instruction. 

A push means the storage of data at the top of a stack provkled in memory, and a pop means the fetching of data 
35 from this stack. In this case, each of tiie push processing and pop processing conprlses this process of storing or fetch- 
ing data and tiie corresponding updating of tiie stack pointer. An ordinary data processing circuit has a push instruction 
for storing data or an address from one register to the stack, and a pop instruction for fetching tiie contents of the stack 
to a register. Each push and pop instruction transfers data between the register and the stack and updates the stack 
pointer in correspondence witti this transferring. , 
40 This means tiiat Is it necessary to execute these instructions repeatedly when transfeiring data between a plurality 
of registers and the stack 

However, the execution of the sequential push instruction or sequential pop instruction of this aspect of tiie Inven- 
tion provides the same effect as executing a plurality of push instructions or a plurality of pop instructions in sequence. 
In other words, it is possible to execute a single instruction to move data between a plurality of registers and the stack. 
45 and also update the stack pointer con^espondingly. This makes it possible to avoid the Increase in size of the object code 
caused by repeated execution of the push or pop instruction, when transferring data between a plurality of registers and 
the stack. This also enables an increase in the speed of processing interrupts and subroutine calls and returns, without 
Increasing tfie number of execution steps in the program and without any wasted cycles. 
The d9ta processing circut of an eighth aspect of this invention further conprlses: 

so 

n general-purpose registers specif led by register numbers 0 to n-1 : 

wherein object code of at least one of the sequential push instruction and the sequential pop instruction 

. comprlses.a final register number to which one of the register numbers is specified, as the data for spedfyino a piu-. . 

rality of registers; and " 

55 the execution means performs at least one of a process of executing a plurality of sequential pushes of data into a 
stack provkled in memory from a plurality of registers starting from register 0 to a register specified by tiie final reg- 
ister number and a process of executing a plijrality of sequential pops of data from ttie stack to a plurality of regis- 
ters starting from register 0 to a register specified by the final register number, based on the contents of a memory 
address specif led by the dedicated stack pointer register. 
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When there are plurality of general-purpose registers, there are usually addresses for specifying these registers. 
This invention specifies these registers by register number, in continuous sequence from 0 to n-1. 

This aspect of the invention makes it possible to execute at least one of a push or pop of data between memory 
and a plurality of continuous registers from register 0 to the register with a final register number, by specifying any reg- 
ister number as the final register wrrber. Therefore, the contents of registers can be saved and restaed efficiently dur- 
ing the execution of a program having a structure that uses registers in sequence from register number 0. 

In a ninth aspect of this invention, the execution means comprises: 

write means for writing the contents of a given register that is one of the plurality of registers to a stack provided in 
memory, based on a memory address specified by the dedicated stack pointer register: 
number-of-times-written count means for counting the number-of-times-written of the write means; and 
comparison means for comparing the number-of-times-written counted by the count means with the value of the 
data for specifying a plurality of registers; 

wherein the write means comprises: 
write memory address generation means for adding a first input and a second input by an adder, and for generating 
a write memory address for specifying a write destination; 

first input control means for providing control such tiiat the first input of the adder is the contents of the dedicated 
stack pointer register at the start of execution of a sequential dedicated instruction, and is a write address gener- 
ated subsequentiy by a write address generation means; 

second input control means for outputting an offset used during the writing of one word from the stack to tiie second 
input of the adder; and 

write means for writing to the stack the contents of a register specified by subtraction processing which uses the 
data for specifying a plurality of registers and the number-of-times-written. based on the write memory address; 

whereby the writing of the contents of the plurality of registers to the stack and the ending of the writing are 
controlled based on the comparison result of the comparison means. 

In a tenth aspect of this lnventk>n, the Instruction execution means comprises: 

read means for reading the contents of a stack provided in menfx)ry teased on a memory address specified by the 
dedicated stack pointer register, and storing ttie contents in a given register of the plurality of registers; 
number-of-times-read count means for counting the number-of-times-read of the read means; and 
comparison means for comparing the number-of-times-read counted by the count means with the value of the data 
for specifying a plurality of registers; 

wherein the read means comprises: 
write memory address generation means for adding a first input and a second ir^ut i3y an adder, and for generating 
a write memory address for specifying a write destination; 

first Input control means for providing control such that the first input of the adder is the contents of the dedk^ated 
stack pointer register at the start of execution of a sequential dedicated instruction, and is a read address gener- 
ated subsequentiy by a read address generation means; 

second input control means for outputting an offset used during the writing of one word from tiie stack to the second 
input of the adder; 

read means for reading the contents of tiie stack based on the read memory address and storing the contents In a 
register specified based on ttie number-of-times-written; 

whereby the reading of the contents of the stack and the ending of ttie reading are controlled based on ttie 
comparison result of the comparison means. 

This configuration makes it possible to implement tiie saving to the stack of the contents of a plurality of registers 
specified by linked sequential values and the restoration of data to a plurality of registers specified by linked sequential 
values, using only a count means and a simple sequence controller. Since ttiis makes It possible to provide a data 
processing circuit with a small number of gates, tiiis invention can be applied to one-chip microcomputers or the like. 

An eleventh aspect of this invention further comprises: 

. a program counter register used only for tiie program counter; 

wherein the group of dedicated stack pointer instructions comprises branch Instructions that are an instruc- 
tion for branching to a subroutine and a return instruction from the subroutine; 
tiie decoding mear^ decodes the branch insta^uctions; 
tiie instruction execution means comprises: 

means for executing at least one of a process of saving the contents of ttie program counter register to a given 
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second area of the stack provided in memory and a process of restoring the contents of the second area to the 
program counter register, based on a memory address specified by the dedicated stack pointer register, during 
the execution of the branch instruction; and 

means for updating the contents of the dedicated stack pointer register based on the saving and restoration. 

5 

In this case, a subroutine could be such as an interrupt processing, exception processing, or debugging processing 
routine. Therefore, an instruction for branching to a subroutine could be such as an instruction that calls a subroutine 
or the like, or a software interrupt instruction or software debugging interrupt instruction for branching to an interrupt 
processing, exception processing, or debugging processing routine. Similarly, a return instruction from the subroutine 
10 could be a return instruction such as from an inten'upt processing, exception processing, and debugging processing 
routine. 

It is usually necessary to save and restore the program counter when t)ranching to a subroutine or returning from 
a subroutine. 

This invention makes it possible to save and restore the program counter simultaneously with the execution of the 
IS instruction for branching to the subroutine and the instruction for returning from ttie subroutine. In other words, the data 
processing circuit of this aspect of tiie invention has a circuit structure that enables the saving and restoration of the 
program counter by either one of Instruction, within an instruction for branching to a subroutine and a return instruction 
from the subroutine. There is therefore no need for Instructions for the saving and restoration of the program counter, 
which is necessary when branching to a sutvoutine and returning therefrom, enabling a reduction in the number of 
20 instructions. This makes It possible to increase the speed of processing during a branch to another routine, such as 
subroutine call and return, witiiout wasting cycles. 

When a software interrupt Instruction is generated, for example, it is necessary to save and restore the processor 
status register that hoUs the current state of the CPU or other data processing circuit It is therefore preferable to per* 
form tiirs saving and restoration of the processor status register simultaneously witii the execution of an Instruction such 
25 as a software interrupt instruction. 

A twelftii aspect of this invention relates to a data processing circuit comprising a plurality of registers provided in 
a contiguous sequence and a stack pointer allocated to one of a plurality of general-purpose registers, the data 
processing circuit further comprise: 

30 means for decoding object code of an instruction that Is at least one of a sequential push instruction and sequential 
pop instruction that each have data for specifying a plurality of registers within object code and for outputting a con- 
trol signal on the basis of the object code; and 

means for performing at least one of a process of executing a plurality of sequential pushes of data from the plural- 
ity of registers to a stack provided in memory and a process of executing a plurality of sequential pops of data from 
3S the stack to the plurality of registers, based on the control signal, the contents of a memory address specified by 
the dedicated stack pointer register, and the data for specifying a plurality of registers, during ttie execution of at 
least one of the sequential push Instruction and the sequential pop Instruction. 

This aspect of the invention relates to a sequential push instruction and a sequential pop instruction that are used 
40 when a general-purpose register Is used as the stack pointer. 

The execub'on of this sequential push instruction or sequential pop Instruction has the same effect as the execution 
of a plurality of push instructions in sequence or ttie execution of a plurality of pop instructions in sequence. In other 
words, the movement of data between a plurality of registers and the stack and the corresponding updating of the stack 
pointer are enabled by the execution of single instructions. This makes it possible to avoid ttie increase in object code 
45 size caused by repeated executions of push or pop instructions, when transferring data between a plurality of registers 
and the stack. This also enables an increase in ttie speed of processing interrupts and subroutine calls and returns, 
without increasing the number of execution steps in ttie program and without any wasted cycles 

In a ttilrteenth aspect of this invention, this data processing circuit uses instructions of a RISC. 

A RISC data processing drcuit is designed with the objectives of reducing the dimensions of the hardware and 
so increasing the speed thereof. Thus, a RISC data circuit has many general-purpose registers. Reduction of ttie nunrber 
of instructions is achieved by using only an Instruction set which has universality! 

Therefore, with a RISC data processing circuit, the stack pointer is allocated to a general-purpose register, and tiie 
processing uses an instruction set that acts upon this general-purpose register when manipulating tiie stack pointer. 
However, this method leads to an increased instruction length, so that memory is used ineffidentiy. 
55 This aspect of the invention makes it possible to increase the efficiency with which memory is used by a RISC data 
processing circuit by reducing the Instruction lengtti. 

In the data processing circuit of a fourteenth aspect of this invention, a fixed-length Instruction is decoded and exe- 
cution is based on ttiat instruction. 

When the use of fixed-length instructions Is compared with the use of variable-length Instructions, it Is possible to 
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shorten time required for decoding the instructions and reduce the dimensions of the data processing circuit itself. To 
ensure that memory Is used efficiently when fixed-length instructions are employed, by avoiding the creation of unused 
portions within instruction, it is preferable to have little variation in the number of bits necessary for each instruction, so 
that they are as short as possible. 

5 This aspect of the invention makes it possible to reduce the length of instructions that act upon the stack pointer 
arxj thus tend to become too long. It is therefore possible to avoid the aeation of unused portions within instructions, 
even when fixed-length instructions are employed, so that memory can be used more efficiently. 

The microcomputer of a fifteenth aspect of this invention comprises the previously described data processing cir- 
cuit of this invention, storage means, and input and output means for inputting data from and for outputting data to exter- 

10 nal devices. 

This aspect of the invention makes it possible to provide a microcomputer with a fast processing speed and efficient 
usage of memory. 

In the microcomputer of a sixteenth aspect of this invention, a program tiiat is executed thereby uses a program 
language that secures a storage region for auto-variables by using the stack pointer. 
15 An example of a language that secures a storage region for auto-variat)les by using tiie stack pointer is C language. 
When the microcomputer of this invention executes a program written in such a language, the processing speed and 
memory usage efficiency can be increased effectively. 

Electronic equipment in accordance with a seventeenth aspect of this invention conprises one of the atx3ve 
desaibed microcomputers of this invention. 
20 This aspect of the invention makes it possible to provide inexpensive, but sophisticated, electronic equipment ttiat 
is facilitated by a data processing circuit tiiat has a fast processing speed and a highly efficient usage of memory. 

Brief Description of the brawiiiffs 

25 Fig. 1 is a schematic view of tiie circuit structure of a CPU in accordance with a first emkxxiiment of this invention: 
Fig. 2 shows a register set of the CPU of this embodiment; 
Rg. 3 is a view illustrative of the basic operation of the stack pointer; 

Fig. 4 shows an example of the usage of a dedicated stack pointer instruction, as well as the usage state of tiie 

stack provided in memory and the state of the stack pointer; 
30 Rg. 5 is a view illustrative of the process of transfer-ring auto-variat)les in memory to registers; 

Rgs. 6A and 6B show the bit fields of an SP-relative load instruction and a general-purpose load instruction; 

Fig. 7 is a flowchart illustrative of the operation during the SP-relative load instruction for reading word data; 

Rg. 8 is a ffowchart illustrative of the operation during the SP-relative load instiruction for writing word data; 

Rgs. 9A to 9F show views illustrative of the usage state of the stack in memory and the state of the stack pointer 
35 during each routine of an executing program that extends over a plurality of routines; 

Rgs. 10A and 10B show the bit fields of a stack pointer move instruction and a general immediate data arithmetic 

insti'uction; 

Rg. 11 is a flowchart illustrative of the operation during tiie add stack pointer move instruction; . 

Rg. 12 is a flowchart illustrative of the operation during the subtract stack pointer move instruction; 
40 Fig. 13 is a view illustrative of program execution control during the call and ret instructions; 

Rg. 14 shows the bits field of an PC-relative subroutine call instruction; 

Rg. 15 is a flowchart illustrative of the operation during the PC-relative subroutine call instruction; 

Rg. 1 6 is a flowchart Illustrative of the operation during the ret instruction; 

Rgs. 17A and 17B schematically show the actions during the execution of push instructions; 
45 Rgs. 1 8 A and 188 schematically show the actions during the execution of pop instructfons; 

Rg. 1 9 shows the bit sti'ucture of the pushn and popn instructions; 

Rg. 20 is a block diagram of the hardware configuration required for executing tiie sequential push instruction 
(pushn) or sequential pop instruction (popn); 

Rg. 21 is a flowchart illustrative of the operation during tiie pushn instructfon; 
so Rg. 22 is a flowchart illustrative of tiie operation during tiie popn instructfon; and 

Rg. 23 Is a block diagram of the hardware of a microcomputer iri accordance with a second embodiment of this 
invention. 

Description of Preferred Embodiments 

55 

Embodiments of this invention are described below with reference to the accompanying drawings. 
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Embodiment 1 

(1) Configuration of the CPU of this Embodiment 

5 The CPU of this embodiment executes virtually all of Its instructions within one cycle, thanlcs to its pipeline and 
load/store architecture. All of the instructions are expressed as 1 6-bit fixed-length instructions, so that instructions proc- 
essed by the CPU of this embodiment are implemented within compact object code. 

In particular, this CPU is configured In such a manner that it has a dedicated stack pointer register to enable the 
efficient coding and execution of processes that act upon the stack pointer. It also enables the decoding and execution 
10 of an instruction set of a group of dedicated stack pointer Instructions that have object code specifying that dedicated 
stack pointer register as an implicit operand. 

A schematic view of the circuit structure of the CPU of this embodiment is shown in Rg. 1 by way of illustration. 
This CPU 10 conprises a register set that includes general-purpose registers 1 1 , a PC register 12 for holding the 
program counter, PSR (processor status register) 13. and a dedicated stack pointer register SP 14; an instruction 
IS decoder 20; an immediate data generator 22; an address adder 30; an ALU 40; a PC incrementer 44; various Internal 
buses 72. 74. 76, arid 78; and various internal signal lines 82, 84, 86. and 88. 

The Instruction decoder 20 decodes object code that has been input, performs the processing necessary for the 
execution of the resultant instruction. arxJ outputs any necessary control signals. Note that this instruction decoder 20 
also functions as a decoding means for decoding the object code of the above mentioned dedicated stack pointer 
20 instructbn and outputting control signals on the basts of that instruction. 

The immediate data generator 22 generates 32-bit immediate data that will be used during execution, on the basis 
of immediate data comprised within the object code, and generates the 0. ±1 . i2. and ±4 constant data that is neces- 
sary for the execution of each instruction. The PC incrementer 44 updates the program counter stored in the PC 12. on 
the basis of the execution cydes of the instructions. The address adder 30 adds the data that is stored in the various 
25 registers and the immediate data generated by the immediate data generator 22 to generate the address data required 
during the reading of data from memory The ALU 40 performs numerical calculations and logical operations. 

This CPU also comprises internal bus and signal lines. The functions of a PA_BUS 72 and a PB.BUS 74 Include 
the transfer of input signals for the ALU 40. The functions of a WW_BUS 76 include the fetching of calculation results 
from the ALU 40, to transfer them to the general-purpose registers. The functions of an XA_BUS 78 include the transfer 
30 of address data that has been fetched from the general-purpose registers 1 1 and the SP 14. An lA signal line 82 trans- 
fers address data from the various components within the CPU to an external LADDR.BUS 92. A OA signal line 84 
transfers ^ress data from the various components within the CPU to an external D_ADDR_BUS 96. A DIN signal line 
86 transfers data from an external D_DATA_BUS 98 to the various components within the CPU. A DOUT signal line 88 
transfers data from the various components within the CPU to the external D_DATA_BUS 98. An lA multiplexer 83 
35 switches between signals that are to be output to the lA signal line 82 (signals on the PA_BUS 72, on the WW_BUS 76, 
from the PC 12. and a value that is the PC plus 2). A DOUT multiplexer 89 switches between signals that are to be out- 
put to the DOUT signal line 88 (signals on the PA_BUS 72. on the WVV_BUS 76, from the PC 12, and the value that is 
the PC plus 2). 

Since the components of the CPU 10 execute Instructions on the basis of control signals tiiat are output from tiie 
40 instruction decoder 20. tiiey also function as means for executing a group of dedicated stack pointer instructions on tiie 
basis of the above described control signals and the contents of the dedicated stack pointer register. 

This CPU 10 transfers signals to and from the exterior through a 16-bit instruction data bus (LDATA_BUS) 94. an 
instruction address bus (LADDR.BUS) 92 for instruction data aocess. the 32-bit data bus (D_DATA_BUS) 98. the data 
address bus (D.ADDR.BUS) 96 for data aocess. and a control bus (not shown in the figure) for control signals. 

4S 

(2) Register Set of the CPU of this Embodiment 

Essential portions of the concept of the register set of the CPU of tills embodiment wilt now be described. 
The register set of the CPU of this embodiment is shown in Rg. 2. This CPU has a register set that comprises tiie 
so 16 general-purpose registers 1 1 , the PC 12, the PSR 13, the SP 14. ALR (arithmetic tow register) 15, and AHR (arith- 
metic high register) 16. 

The general-purpose registers 1 1 are functionally equivalent to 32-bit registers and are labeled RO to R15, These 
general-purpose registers 1 1 are used during data calculations and address computations. 

The PC 12 IS a 32-bit-long incremental counter that holds a program counter indicating tiie address of the currentiy 
ss executing instruction. In this document, "PC" is used when referring to the register itself and "program counter" is used 
when referring to the value stored within tiie PC. 

The PC 1 2 cannot be accessed directiy by insti-uctions such as the load instruction. Witfi a call or int instruction, or 
when an interrupt or exception occurs, the program counter is read from tiie PC 12 and saved to the stack. In tiiis man- 
ner, a jump destination address used when a branch Instruction is executed can be set in the PC. This also happens for 
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a branch specified by a conditional branch instruction. The Instruction address for the return destination Is read from the 
stack by a ret or reti instruction and Is replaced In tiie PC12. 

PSR (processor status register) 13 is a 32-bit register to which a flag is allocated, and which holds the current sta- 
tus of the CPU. When an int instruction, an inten-upt. an exception, or the like occurs, the state of the PSR at that point 
5 Is saved to the stack when the flow branches to the corresponding processing routine. Conversely, the execution of the 
reti instruction causes tiie saved value to be replaced In the PSR. 

The SP 1 4 is a 32-bit dedicated stack pointer register containing the stack pointer that Indicates the higher address 
of the stack. In this document, SP is used when refemng to the register itself and stack pointer is used when refemng 
to the value stored within the SP. Note, howe/er. that since tiie stack pointer always points to a word boundary, the low- 
to ermost two bits of the stack pointer are always zero. 

This stack pointer is updated at tiie generation of a trap or at the execution of one of a group of dedicated stack 
pointer instructions that are provided by tills embodiment. 

Examples of these dedicated stack pointer instructions that update the stack pointer include instructions that 
branch to another routine, such as the call and ret insti'uctions, a stack pointer move instruction, a pushn instruction. 
15 and a popn instruction. For instance, when the call instruction is executed, the stack pointer is first decremented by an 
amount equivalent to tiie word size (-4), then the PC12 Is saved to ttie stack. Conversely, when the ret Instruction Is 
executed, the destination address for return Is loaded into tiie PC from tine stacK and tiie stack pointer is Incremented 
by an amount equivalent to ttie word size (+4). When an int instruction is executed, or when an Interrupt or exception 
occurs, tiie value in the PC or PSR is saved to the stack by tiie following procedure: 

20 

1.SP = SP-4 

2. The PC is saved at the higher address of tiie stacK Indicated by tiie stack pointer. 

3. SP = SP - 4 

4. The PSR is saved at the higher address of the stacK indicated by tiie stack pointer. 

25 . 

When tiie reti instruction is executed, tiie above process is reversed to restore the CPU to its original state. Thus 
the execution of tiie call, ret. or int instruction causes tiie stack pointer to be updated as appropriate fbr ttiat execution.' 
Details of each insti'uction will be given later. 

A trap function provided by ttiis embodiment comprises inten-upts ttiat are generated asynchronously with the exe- 
cution^of instructions and exceptions generated by the execution of instructions. When a trap is generated, tiie CPU 
saves PC (program counter) and PSR (process status register) to tiie stack, then reads a vector table from a trap table 
and branches to a routine corresponding to that trap. Concomitant with tiie generation of tiie triap. IE (Interupt enable) 
bit is cleared and tiie generation of siteequent maskable interrupts Is Inhibited. Maskable external interrupts are ena- 
bled again by using a load tnstructa'on with respect to the PSR to write 1 to tiie IE bit. 
35 The reti insti'uction is used to return from ttie trap processing routine to tiie original routine. When the reti insti'uction 
Is executed, the CPU reads ttie PSR and PC from the stack In ttiat order, restores tine PSR to its original value and also 
branches to ttie return address. Note that exceptions include debugging exceptions, address misalignment exceptions, 
overfk)w exceptions, and zero-division exceptions. 

Detailed desaiptions of ALR (arrthmetk; low register) 15 and ALH (arithmetic high register) 16 are omitted. 
40 The special registers PSR 13. SP 14, ALR 15. and AHR 16 of ttie CPU are capable of transferring data to and from 
the general-purpose registers, using load instruction. Each register has a special register number and Is accessed by 
using that nuntier. 



Special Register Name 


Special Register Number 


Assembler Mnemonic 


Process status register 


0 


%PSR 


SP 


1 


%SP 


Arithmetic low register 


2 


%ALR 


Arithmetic high register 


3 


%AHR 



55 

(3) Description of Stack and Stack Pointer 

The stack is a temporary storage region provided in memory. It Is a contiguous region In which data is written from 
the bottom up. as if stacked In a rack. The stack pointer indk:ates ttie address of ttie data at ttie top of the stack, ttiat is. 
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the data that was written most recently into the stack. 

The basic operation of the stack pointer will now be desaibed with reference to Fig. 3. 

Reference number 100 in Rg. 3 denotes a stack region provided in memory. A hatched portion 102 represents data 
that was most recently stored in this region, and a memory address 1000 indicates that data. Note that a region 104 
5 below the hatched portion 102 contains data that has already been stored and a portion 106 above the hatched portkm 
102 is available for storing data in the future. 

The stack pointer always Indicates a word boundary. This means that, when data Is written to the stack, the stack 
pointer within the SP 14 moves upward by 4 and the data is stored at the location indicated by that stack pointer. When 
data stored in the stack is fetched, it is the data at the address indicated by the current SP 14 that is fetched and the 
10 stack pointer stored in the SP 14 moves downward by 4. In this manner, the stack pointer always indicates the storage 
address of the data that was stored most recently in the stack. 

(4) Description of Group of Dedicated Stack Pointer Instructions 

IS A RISC CPU always uses a general -purpose register as the stack pointer. This embodiment has the SP 14, which 
is a dedicated stack pointer register, which is operated upon by the previously mentioned group of dedicated stack 
pointer Instructtons. 

This group of dedicated stack pointer instructions is a generic name for a number of instructions that access the 
SP 14 as an implicit operand and perform operations on the SP 14. The group of dedicated stack pointer instructions 
20 comprises instructions that branch to other routines, such as SP-relative load instructions (Id, etc.). stack pointer move 
instructions (add. sub), instructions for branching to subroutines (call, etc), and return instructions (ret, etc.); a sequen- 
tial push instruction (pushn); and a sequential pop instruction (popn). 

What these Instructions have in common is that they are dedicated stack pointer instructions so there is no need 
for data specifying the stack pointer in the object code. Another common feature is that they enable the efficient coding 
2S in short instructions of processes that use tiie stack pointer. 

The use of these instructions makes it possible to eff identiy process data that Is stored in a stack provkied in mem- 
ory. Interrupt and subroutine call/ireturn processes can also be perfamed efficiently. 

An example of the usage of these dedicated stack pointer instructk)ns when a subroutine is called will now be 
described with reference to Fig. 4. This figure also shows the usage state of the stack provided in memory and the state 
30 of the stack pointer. A MAIN program 500 and a subroutine 520 of Rg. 4 are written as object code created by a C com- 
piler. Reference number 540 denotes the state of a stack in memory. Reference number 502 indicates that processing 
is being performed, using the general-purpose registers RO to R3. Reference number 506 denotes a subroutine call 
instruction. Before the subroutine call instruction is executed by the MAIN program 500. in other words, after the previ- 
ous instruction indicated by 504 has been executed, the stack pointer (SP) points to an address <1 ) in the stack in mem- 
35 ory. When this subroutine call instruction is executed, control passes to the subroutine 520. At this point, an instruction 
for branching to a subroutine (call instruction), which is one of the dedicated stack pointer instructions of this embodi- 
ment, is executed. When this instruction is executed, the value of tiie stack pointer (SP) is automatically incremented 
by -4 (at (2) in Fig. 4) and the address for return to the MAIN program is stored in an area 544 of the stack that is indi- 
cated by the stack pointer <2 ) . 

40 When the subroutine 520 starts executing. It first transfers the values that are stored in the general-purpose regis- 
ters RO to R3 used by the MAIN program. Reference number 524 denotes a sequential push instruction (pushn) which 
is one of the dedicated stack pointer instructions that saves the values stored in the general -purpose registers RO to R3 
to the stack. When this instruction is executed, the values stored in tiie generaliourpose registers RO to R3 are trans- 
ferred in sequence to the stack, so tiiat they are stored in the stack 540 as shown at 550 in Rg. 4. When the execution 

45 of this process ends (524 (3 ) ). the stack pointer (SP) is at 546 ( <3 > SP). 

The subroutine 520 then secures an auto-variable region to be used t>y the subroutine. An add instruction denoted 
by 526 is a stack pointer move instruction which is a dedicated stack pointer instruction that causes the stack pointer to 
move upward to secure a stack region to be used by tiie subroutine 520. When this instruction is executed, the stack 
pointer (SP) moves upward by X bytes to a k)cation 548 (<4) SP) to secure an auto-variable region to be used by the 

50 subroutine, as Stated in Note 2. 

Reference number 528 denotes processing that uses auto-variables and the general-purpose registers RO to 
within the subroutine 520. At this point, the location of tiie stack pointer is as indicated by <5 > and the loading of tiie 
auto-variables is performed by using a dedicated SP load instruction that is one of the dedicated stack pointer instruc- 
tions. 

55 Reference number 529 denotes this SP load instruction that transfers an auto- variable SI that is stored in memory 
to the general-purpose register R1 . This auto-variable SI Is staed at a location that Is offset by Y bytes from the stack 
poMet ((5> SP). The stack pointer does not move during the above processing of the subroutine 528, so the memory 
address of the auto-variable is specified by the stack pointer plus the offset. This makes it possible to use the dedicated 
SP load instruction to fetch data to and from the general-purpose registers efficiently. 
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Before control returns from the subroutine 520 to the MAIN program 500. the values in the general-purpose regis- 
ters RO to R3 that were saved to the stack must be restored to the general-purpose registers RO to R3 and the stack 
pointer must be reset to indicate the area 544 that contains the address for return to the MAIN routine. This restores the 
stack pointer that was moved by the stack pointer move instruction of 526 back to its original location. A sub instruction 

5 indicated by 530 is a stack pointer move instruction which is a dedicated stack pointer Instruction that moves the stack 
pointer downward. When this instruction is executed, the stack pointer moves to 546 ((6) SP). The data 550 stored in 
the stack is restored to the general-purpose registers RO to R3. Reference number 532 denotes a sequential pop 
instruction (popn) which is a dedicated stack pointer instruction that transfers data in the stack to the general-purpose 
registers RO to R3. When this instruction is executed, the values 550 stored in the stack are sequentially transfened to 

10 the general-purpose registers RO to R3 so that they are stored in the stack 540 as shown at 550 in Fig. 4. When the 
execution of this process ends (532 <7)). the stack pointer indicates 544 ((7> SP). 

Reference number 534 denotes a return instruction. When this return instruction is executed, control passes to the 
MAIN program. In this case, a return instruction (ret Instruction) which is a dedicated stack pointer instruction is exe- 
cuted by this embodiment. When this instruction Is executed, control branches to an Instruction that indicates the 

IS address for return to the MAIN program, which Is stored in the stack area indicated by C7) SR In other words, the MAIN 
program 500 returns to the next instruction 507. The value of the stack pointer is automatically incremented by -44 to 
move It to the top area of the stack used by the MAIN program 500 (<8) SP In Rg. 4). 

The description now turns to details of each of the dedicated stack pointer instructions, as well as the circuit struc- 
ture required for executing these instructions and the operation during such execution. 

20 

(5) Stack Pointer (SP) Relative Load Instructions 

A C compiler creates object code that specifies that an auto-variable region is to be linked to the stack pointer. More 
specifically, the C compiler acts as means for securing an auto-variat^e region that is Y bytes long, at an offset of X from 
25 the Stack pointer. 

A view used for illustrating the process of transfening auto-variables in memay to registers is shown in Fig. 5. In 
this figure, auto-variable a Is woid data, auto-variables b and c are each half-word data, and auto-variables d to g are 
each byte data. The stack pointer stored In the SP indicates 1 000, which is the address of the top region of a region 600 
of the stack in which auto-variables are stored. An area of the stack at the menfx>ry address indfoated by the stack 

30 pointer is secured as an area for holding the auto-variable a. Areas in the stack at memory addresses indicated by the 
stack pointer plus 2 and the stack pointer plus 4 are secured as areas for hokling the auto-variables b and c, respec- 
tively. Similarly, areas in the stack at memory addresses Indicated by the stack pointer plus 5. the stack pointer plus 6, 
the stack pointer plus 7, and the stack pointer plus 8 are secured as areas for holding the auto-variables d to g. respec- 
tively. During the executfon of a given process, it may be necessary to transfer data between a general-purpose register 

3S and an auto-variable which is specified by a memory address that is the stack pointer plus an offset. 

To enable the CPU of this embodiment to execute the above transfer processing efficiently with shorter object code, 
the following instructfon set is provkled as SP-relative bad instructions that are some of the dedicated stack pointer 
instructfons: 

40 ld.b%Rd.[%spfimm6l(1) 

ld.ub %Rd. [%sp+imm6] (2) 

ld.h%Rd.[%spfimm7)(3) 

W.uh %Rd, |%sp+imm7] (4) 

id.w%Rd,[%sp+imm8](5) 
45 ld.b[%8pfimm6],%R8(6) 

ld.h [%sp4-imm7], %Rs (7) 

id.w [%sp+imm8], %Rs (8) 

Instructions (1) to (8) are instruction codes created by an assembler. Instruction (1) sign-expands byte data and trans- 
50 fers it from the stack to a register, instruction (2) zero-expands byte data and transfers it from the stack to a register. 
Instruction (3) sign-expands half-word data and transfers it from the stack to a register, instruction (4) Zero-expands 
half-word data and transfers it from the stack to a register, instruction (5) transfers word data from the stack to a register, 
instructfon (6) transfers byte data from a register to the stacK instruction (7) transfers half-word data from a register to 
the stacK and instruction (8) transfers word data from a register to the stack.' 
55 The operands [%sp4-imm6]. [%sp4-imm7]. and [%sp44mm8] each represent immediate offset data. A memory 
address is aeated by adding an offset that is created during the execution of the instruction on the basts of this imme- 
diate offset data and the value of the stack pointer stored in the SP 1 4. [%sp+imm6] is immediate offset data used when 
the size of the data in memory is bytes. [%sp+imm7] is that when the data size is half-words, and [%sp^imm8] is that 
when the data size is words. Whichever size of data is used, it is written as 6-bit immediate offset data 614 in the object 
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ccxje. as will be described later with reference to Fig. 6A. If imm7 Is used (in other words, if the data size is half-words), 
however, the Immediate offset data 614 Is shifted one bit to the left during the execution of the Instruction, to generate 
an offset to be added to the stack pointer. Similarly, if imm8 is used (In other words, if the data size is words), the Imme- 
diate offset data 614 is shifted two bits to the left, to generate an offset to be added to the stack pointer. 

5 In this case, the stack Is a temporary storage region provided in memory and the above memory addresses can be 
used to specify the locations of. for example, auto-variables (a to g) within the stack, as shown In Fig. 5. 

An example of a bit field 610 of these SP-relative load Instructions (1) to (8) is shown in Rg. 6A. The SP-relative 
toad instruction of Fig. 6A has 16 bits of object code comprising op code 612 (6 bits) indicating that the operating func- 
tion is a transfer of data between memory and a general-purpose register, immediate offset data 614 (6 bits) specif red 

10 by immediate data, and a register number 61 6 (4 bits) specifying the register that is to be used for the transfer. The op 
code 612 comprises common code indicating that this is a load instruction thiat acts upon the SP 14 and different codes 
specifying each of data size, sign expansion, and zero expansion. Since It Is therefore dear from the op code that these 
instructions act upon the stack pointer stored in the SP 14. there is no necessity to specify data relating to the stack 
pointer in the operands of the object code. The immediate offset data 614 is used for aeating an offset from the stack 

15 pointer for data that is to be transfen-ed. This is specified in 6 bits, regardless of the data size, as described above. With 
Instructions (1) to (5). the address (register number) 616 of the register that Is Involved In the transfer will contain the 
number of the register to contain data that lias been read out from the stack: with Instructions (6) to (8). It will contain 
the number of the register that currently contains data to be written to the stack. 

By way of comparison. Fig. 6B shows an example of a bit field 620 of object code of a load instruction that acts upon 

20 a general-purpose register when another general-purpose register is used as the stack pointer (hereinafter called a 
general-purpose load instruction). 

The general-purpose k>ad instruction of Rg. 6B has 20 bits of object code comprising op code 622 (6 bits) Indicat* 
Ing that the operating function Is a transfer of data between memory and a general-purpose register, immediate offset 
data 624 (6 bits) specified by immediate data, a first register number 620 (4 bits) specifying the general-purpose regis- 

25 ter to be used as the stack pointer, and a second register number 628 (4 bits) specifying the register that is to be used 
for the transfer. With a general-purpose microcomputer, the instruction length is in S-bit units, which gives 24-bit or 32- 
bit Instructions. 

Each of Rgs. 6A and 6B shows object code for an Instruction used when transferring data between a register and 
a memory address specified by adding an offset to the stack pointer. It is dear from these examples that a SP-relatlve 

30 load instruction can be written with shorter object code than a general-purpose load instruction. 

The description now turns to the configuration required for executing the at>ove instructions and the operations that 
occur during this execution, taking as examples the instruction (5) that transfers word data from the stack to a register 
(since this Instruction reads word data from the stack to a register, it is hereinafter called an SP-reiatlve load Instruction 
for reading word data) and the instructbn (8) that transfers word data from a register to the stack (since this instruction 

35 writes word data from a register to the stadc, it Is hereinafter called an SP-relative load instruction for writing word data). 
The hardware configuration necessary for executing the instructions will be described first, using Fig. 1 for refer- 
ence. Each of these Instructions Is transferred over the l_DATA_BUS 94 from an external memory (ROM) 52 and is 
Input to the Instruction decoder 20. The instruction Is decoded by the instruction decoder 20. which outputs various sig- 
nals (not shown in the figure) necessary for executing the Instruction. The immediate data generator 22 performs a left- 

40 ward logical shift on the immediate offset data 61 4 in accordance with the data size, generates the offset used during 
the execution of sign expansion or zero expansion, if necessary, and outputs the results to the PB_BUS 74. The SP 14 
contains the stack pointer, and that value can be output to tiie XA.6US 78 which is connected to an input of the address 
adder 30. Another input of the address adder 30 is connected to the PB.BUS 74 that Is an output of the Immediate data 
generator 22. An ou^ut (ADDR) of the address adder 30 Is connected to the extemal D.ADDR.BUS 96 by the DA slg- 

45 nalllne84« 

BCU (bus control unit) 60 controls the input and output of data to and from memory (RAM and ROM) 50 and 52 
(which includes the stack area) and outputs READ and WRITE control signals in accordance with varbus request sig- 
nals that are output from the GPU (such as signals output to external buses). 

The description first concerns the operation during the execution of an SP-relative load instruction for reading word 
so data. 

When the SP-relative load Instruction for reading word data is executed, the value of the stack pointer stored In the 
SP 1 4 is added to the offset created by the immediate data generator 22 on the basis of the immediate offset data 61 4, 
to create a memory address for reading. Data is read from memory on tiie basis of this memory address and is trans- 
fenred to the general-purpose register specified by tiie register number 61 6 in the object code. 
55 A flowchart Illustrating the operation during the SP-relative load instruction for reading word data Is shown in Fig. 7. 

At the start of execution of this instruction, the stack pointer stored in tiie SP 14 Is output to the XA.BUS 78 (step 
S210). An offset imm created by the immediate data generator 22 from tiie immediate offset data is then output to the 
PB.BUS 74 (step S212). The address adder 30 adds the value on the XA.BUS 78 and the value on the PB.BUS 74. 
then outputs the resultant memory read address (ADDR) over the DA signal line 84 to the D.ADDR.BUS 96 (steps 



14 



EP0 809 180 A2 



S214 and S216). A data read request signal from the CPU to the BCD 60 becomes active and executes an external 
memay read cycle (step S218). In other words, the BCU 60 follcws the request signal to control the reading of data 
from menwry, using this read address as a memory address, and the output of the data to the D_DATA_BUS 98. The 
data on the D_DATA_BUS 98 is output to the WVV_BUS 76 over the DIN signal line 86 (step S220). The value on the 
5 WW_BUS 76 Is stored in the register (%Rd) having the register number specified by the 4-blt address 61 6 of the regis- 
ter (Ra/Rs) that is specified in the instruction code for the transfer (step S222). 

The desaiption now turns to the operation during the execution of an SP-relative load Instruction for writing word 
data 

When the SP -relative toad instruction for writing word data is executed, the value of the stack pointer stored in the 
10 SP 1 4 is added to the offset created by the immediate data generator 22 on the basis of the immediate offset data 61 4. 
to create a memory address for writing to memory. The data stored in the general-purpose register specified by the reg- 
ister number 616 in the object code is transferred to the areas in memory specified by that memory address. 

A flowchart illustrating the operation during the SP-relative load instruction for writing word data is shown in Fig. 8. 
At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 (step 
IS S230). An offset imm created by the immediate data generator 22 from the immediate offset data is then output to the 
PB.BUS 74 (step S232). The address adder 30 adds the value on the XA.BUS 78 and the value on the PB.BUS 74, 
then outputs the resultant memory write address (ADDR) over the DA signal line 84 to the D_ADDR_BUS 96 (steps 
S234 and S236). The data stored in the register (%Rd) having the register number specified by the 4'bit address 616 
of the register (Rs/Rd) that is specified in the instruction code for the transfer is output to the RAMBUS 72 (step S238). 
20 The data on the PA_BUS 72 is output to the D_DATA_BUS 98 over the DOUT signal line 88 (step S240). A data write 
request signal from the CPU to the BCU 60 becomes active and executes an external memory write cycle (step S242). 
In other words, the BCU 60 follows the request signal to control the operation of wrKing to memory the data that has 
been transferred to the D_bATA_BUS 98. using this write address as a memory address. 

25 (6) Stack Pointer Move Instruction 

Views used to illustrate the usage state of the stack in memory and the state of the stack pointer during each rou- 
tine of a program that extends over a plurality of routines are shown in Figs. 9A to 9F. 

The states of the stack region in memory and the stack pointer (the value stored in the SP 1 4) during the execution 
30 of a given process 'a' 21 1 of a MAIN routine 210 of Fig. 9A are shown in Fig. 9D. Reference number 222 denotes a stack 
region that is to be used by the MAIN routine 210, and the stack pointer indicates a higher address 232 of the region 
222. 

A SUB1 routine 212 of Rg. 9B is a subroutine that Is called and executed from the MAIN routine 210. The states of 
the stack region in memory and the stack pointer (ttie value stored In the SP 14) during the execution of a given process 
35 t>' 213 of the SUB1 routine 212 are shown in Fig. 9E. Reference number 224 denotes a stack region that is to be used 
by the SUB1 routine 212, and the stack pointer indicates a higher address 234 of the region 224. 

A SUB2 routine 214 shown in Fig. 9C is called and executed from the SUB1 routine 212. The states of the stack 
region in memory and the stack pointer (the value stored in the SP 14) during the execution of a given process 'c' 215 
of the SUB2 routine 214 are shown In Rg. 9F. Reference number 226 denotes a stack region that Is to be used by the 
40 SUB2 routine 214. and the stack pointer indicates a higher address 236 of \he region 226. 

When execution extends over a plurality of subroutines in this manner, the stack region used by each routine 
moves, so that the value of the stack pointer is moved as appropriate to tiie top of the stack region used by each routine. 

To enable the CPU of this embodiment to execute this movement of the stack pointer effidently with shorter object 
code, the following instruction set is provided as stack pointer move instructions that are some of tiie dedicated stack 
45 pointer instructions: 

add%sp, imm12 (9) 
sub%sp, imm12(10) 

so Instructions (9) and (10) are Instruction codes aeated by an assembler. Instruction (9) is an immediate data add 
instruction for the stack pointer stored in the SP 14 and instruction (10) is an immediate data subtract Instruction for this 
stack pointer. The operand imm12 is 32-bit data obtained by shifting the lO-bit immediate data by two bits leftward then 
subjecting it to zero expansion. It is used for operations with the stack pointer that is stored in the SP 14. 

A bit i\e\d 630 of the stack pointer move instructions (9) and (10) is shown in Fig. 10A. The stack pointer rixsve 

55 lnstructk)n of Fig. 1 0A has 16 bits of object code comprising op code 632 (6 bits) Indicating whether the move data for 
the stack pointer stored in the SP 14 is an addition or subtraction and immediate data 634 (10 bits). This op code 632 
comprises common code Indicating that this is a stack pointer move instruction that acts upon the SP 14 and different 
codes specifying eitiier addition or subtraction. Since it is tiierefore clear from the op code that these instructions oper- 
ated upon the stack pointer stored in ttie SP 14, there is no necessity to specify data relating to the stack pointer in ttie . 
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operands of the object code. The immediate offset data 634 is used for aeattng an offset for subtracting from or adding 
to the stack pointer. 

By way of comparison, Fig. 10B shows an example of a bit field 640 of object code of an add/subtract instruction 
when a general-purpose register is used as the stack pointer, (hereinafter called a general-purpose calculation instruc- 
5 tion). 

The generalisurpose calculation instruction of Fig. 10B has 20 bits of object code comprising op code 642 (6 bits) 
indicating that the operating function is a addition or subtraction of immediate data to or from a value in a generaliDur- 
pose register, immediate data 644 (10 bits), and a register number 646 (4 bits] specifying the register that is to be oper- 
ated upon. With a general-purpose miaocomputer, the instruction length is in 8-bit units, which gives 24-bit or 32-bit 
10 Instructions. 

Each of Rgs. 10A and 10B shows object code for an instruction used when adding or subtracting Immediate data 
to or from the stack pointer, tt is clear from these examples that a stack pointer move instruction can be written with 
shorter object code than a general-purpose calculation instruction. 

The description now turns to the configuration required for executing the above instructions and the operations that 
IS occur during this execution, taking as an example the instruction (9) that adds immediate data to the SP (hereinafter 
called an add stack pointer move instruction) and the Instruction (10) that subtracts immediate data from the SP (here- 
inafter called a subtract stack pointer move instruction). 

The hardware configuration necessary for executing the instructions will be described first, using Fig. 1 for refer- 
ence. Each of these instructions is transferred over the l_DATA_BUS 94 from an external memory (ROM) 52 and is 
20 input to the instruction decoder 20. The instruction is decoded by the instruction decoder 20. which outputs various sig- 
nals (not shown in the figure) necessary for executing the instruction. The immediate data generator 22 performs a log* 
leal shift by two bits to the left on the 10 bits of the immediate data 634. subjects it to zero expansion, and outputs It to 
the PA_BUS 72. The SP 14 contains the stack pointer, and that value is output to the XA_BUS 78. The XA^BUS 78 is 
connected to the PB_BUS 74 which forms an input of the ALU 40. Another Input of the ALU 40 is connected to the 
25 PA_BUS 72 that is an output of the immediate data generator 22. The output of the ALU 40 Is connected to the 
WW_BUS 76. This WW_BUS 76 is connected to the input of the SR 

The desaiption first concerns the operation during the execution of the add stack pointer move instructbn. 

When the add stack pointer move instruction Is executed, the value of the stack pointer stored In the SP 1 4 Is added 
to the offset created by the Immediate data generator 22 on the basis of the immediate offset data 634. to create a new 
30 stack pointer. That value is stored in the SP 1 4. 

A f bwchart illustrating the operation during the add stack pointer move instruction is shown in Fig. 1 1 . 

At the start of execution of this instruction, the stack pointer stored in the SP 14 Is output to the XA.BUS 78 (step 
S250). The data on the XA.BUS 78 is output to the PB.BUS 74 (step S252). Move Immediate data imm created by the 
immediate data generator 22 on the basis of the Immediate data Is output to the PA.BUS 72 (step S254). The ALU 40 
35 adds the value on the PB_BUS 74 and the value on the PA_BUS 72, and outputs the result to the WW_BUS 76 (step 
S256). The value on the WW_BUS 76 is then input to ttie SP 1 4 (step S258). The description now turns to the operation 
during the execution of the subtract stack pointer nxyve instruction. 

When the subtract stack pointer move Instruction Is executed, the move immediate data created by the immediate 
data generator 22 on the basis of the immediate data 634 is subtracted from the value of the stack pointer stored In the 
40 SP 14 to create a new stack pointer. That value is stored in the SP 14. 

A flowchart illustrating tiie operation during the sut>tract stack pointer nx)ve instruction is shown in Fig. 1 2. 

At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA.BUS 78 (step 
S260). The data on the XA.BUS 78 Is then output to the PB.BUS 74 (step S262). Move Immediate data imm created 
kiy the immediate data generator 22 on the basis of the Immediate data is output to the PA_BUS 72 (step S264). The 
45 ALU 40 subtracts the value on the PA^BUS 72 from the value on the PB_BUS 74. and outputs the result to tiie 
WW^BUS 76 (step S266). The value on the WW_BUS 76 is then input to the SP 14 (step S268). 

(7) Branch Instruction 

50 A view illustrating program execution control during tiie call and ret instructions is shown in Rg. 13. During a MAIN 
routine 300 shown in Fig. 13. a call Instruction (302) is executed to branch to a subroutine SUB 310, so that control 
passes to tiie subroutine SUB 310. A ret instruction (312) is placed at the end of the subroutine. When this instruction 

.... Js executed, the flow returns to the next the instruction (304) after the call instruction (302) of the MAIN routine 300. 
Therefore, the program Is executed in the sequence <1), (2), then <3>lnRg. 13. At the end of the execution of this 

55 subroutine SUB 310. control returns to the MAIN routine 300 and execution Is to continue from the next instruction (304) 
after the call instruction (302). This makes it necessary to record the address of the return destination somewhere, 
when a branch occurs to the subroutine SUB 310. For that reason, the return destination address is saved to the stack 
as shown in Rg. 4 during the execution of a branch instruction that branches to a subroutine, such as the call instruc- 
tion, tiien the return destination address Is returned from tiie stack to the program counter during the execution of a 
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branch instruction that returns from a subroutine, such as the ret Instruction (this process is hereinafter called saving 
and restoring the program counter). 

With a prior-art RISC CPU. the process of saving and restoring the program counter is done by software, so that it 
is necessary to provide object code (assembler instructions) for executing this process during the execution of a branch 
5 instruction such as the call instruction. For example, it is necessary to provide object code (assembler instructions) for 
decrementing the stack pointer by an amount equivalent to the word size (4) during the execution of the call instruction, 
to place the address of the next instruction after the call instruction in the stack on the basis of the value in the program 
counter. 

The CPU of this embodiment, however, has a hardware configuration that enables the saving and restoration of the 
10 program counter when the call or ret instruction is executed. It is therefore not necessary to provide object code (assem- 
bler Instructions) that saves and restores the program counter separately from the object code of the call and ret Instruc- 
tions. 

To enable the CPU of this embodiment to execute this process of saving and restoring the program counter by a 
single instruction, the following instruction set is provided as branch instructions that are some of the dedicated stack 
IS pointer instructions: 

callsign9(11) 
call %Rb(12) 
ret (13) 
20 reti(14) 
retd (15) 
intimm2(16) 
brk(17) 

2S Instructions (1 1) to (17) are instruction codes created by an assembler. Instruction (11) is a PC-relative subroutine call 
instmction which uses the program counter (PC) as a base address, to branch in a relative manner to a branch desti- 
nation address on the basis of the displacement data sign9 specified as the operarid. Instructk)n (12) is a register indi- 
rect subroutine call Instruction which branches to the branch destination address contained within the register specified 
as the operand. Instruction (13) is an instruction for return from a subroutine. Instruction (14) is an instruction for return 

30 from an interrupt or exception processing routine. Instruction (15) is an instruction for return from a debugging process- 
ing routine. Instruction (16) is a software interrupt instruction. Instruction (17) is a software/debugging interrupt instruc- 
tion. 

An example of a bit f ieki 650 of the PC-relative subroutine call Instruction (1 1) Is shown in Fig. 1 4. The PC-relative 
subroutine call instruction of Rg. 14 has 16 bits of object code comprising op code 652 (8 bits) indicating that the oper- 
as ating function is a call instruction tiiat branches to a subroutine with a branch destination address that is specified rela- 
tively, using the program counter as a base address, and displacement data signs (8 bits) 654 specified by immediate 
data. During the execution of this command, the 8-bit immediate data is shifted logically by one bit to the left then is sub- 
jected to sign expansion. 

The CPU of this embodiment is capat)le of saving the program counter to the stack at the execution of the call 
40 Instruction, with only the object code shown in Fig. 1 4. 

The description now turns to the configuration required for executing ttie above insfuctions and the operations that 
occur during tiiis execution, taking as examples the PC-relative subroutine call instruction (11) as an instruction for 
branching to a subroutine and the retum Instruction (13) as an instruction for returning from the subroutine. 

The hardware configuration necessary for executing the instructions will be described first using Fig. 1 for refer- 
45 ence. Each of these instructions is transferred over the LDATA.BUS 94 from ecternal memory (ROM) 52 and is input 
to the instruction decoder 20. The instruction is decoded by the instruction decoder 20. which outputs various signals 
(not shown in the figure) necessary for executing the instruction. The immediate data generator 22 performs a logical 
shift by one bit to ttie left on the displacement data 654. subjects it to sign expansion to create a 32-bit immediate data 
displacement imm. then outputs the result to the PB_BUS 74. The SP 14 contains ttie stack pointer, and that value is 
so output to the XA.BUS 78 which is connected to an input of the address adder 30. Another input of the address adder 
30 is connected to tiie PB.BUS 74 that is an output of the Immediate data generator 22. An output (ADDR) of the 
address adder 30 is connected to the external LADDR.BUS 92 by the lA signal line 82. 

The LADDR^ BUS 92 and LDATA_BUS 94 are connected to the ROM 52 in which is stored the instruction object 

code. The bus control unit (BCU) 60 outputs READ control signals that read tinis instruction object code from the mem- ' 
55 ory (ROM) 52 in accordance with various request signals that are output from the CPU (such as signals output to exter- 
nal buses). 

The description first concerns ttie operation during the execution of an PC-relative subroutine call instruction. 
When the PC-relative subroutine call instruction is executed, the value of the program counter stored in the PC 12 
Is saved to the stack as described wHh reference to Rg. 4. and the value of the stack pointer stored In the SP 1 4 is dec- 
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remented by the word size (4). A branch destination address obtained by adding the program counter and the 32-bit 
immediate data displacement is set in the PC 12. 

A flowchart illustrating the operation during the PC-relative subroutine call Instruction is shown in Fig. 15. 

At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 (step 

5 S270). This forms one input of the address adder, and constant data (-4) created by tiie immediate data generator 22 
forms the other Input of the address adder 30. The constant -4 Is added to the value of the stack pointer on the XA_BUS 
78 by the address adder 30 to create a write address to the stack for storing the return address, and the result is output 
to the WW_BUS 76. This write address is output to the D_ADDR_BUS 96 (step S272). The value of the program coun- 
ter stored In the PC 12 is incremented by 2 by the PC incrementer 44 to create the return address, and the result is 

10 output to the D_DATA_BUS 98 over the DOUT signal line 88 (step 8274). A data write request signal from the CPU to 
. the BCD 60 becomes active and executes an external memory write cycle (step 8276). In other words, the BCU 60 fol- 
lows the request signal to store this return address in the stack provided in memory, using this write address as a mem- 
ory address. 

The value on the WW_BUS 76 Is then output to the SP 1 4 (step S278). In other words, the value of the stack pointer 
IS Is updated by being decremented by 4. 

The program counter stored In the PC 12 is then output to the XA_BUS 78 (step S280). The immediate data gen- 
erator 22 performs a logical shift by one bit to the left on the displacement data 654, subjects it to sign expansion to 
create a 32-bit immediate data displacement Imm, then outputs the result to the PB_BUS 74, The address adder 30 
adds the program counter on the XA_BUS 78 and the immediate data displacement imm on the PB_BUS 74 to create 
20 a branch address (ADDR), then outputs that address over the lA signal line 82 to the LADDR_BUS 92 (step S282). An 
instruction read request signal from the CPU to the BCU 60 becomes active and executes a read cycle with respect to 
the external memory (ROM) 52 to fetch the object code of the branch destination instructk>n (step S284). 

The operation during tiie execution of the ret instruction will now be described. 

When the ret instruction is executed, the value of the program counter that was saved to the stack is restored to the 
25 PC 12. and the value of the stack pointer stored In the SP 14 Is Incremented by the word size (4). as described with 
reference to Fig. 4. 

A flowchart Illustrating the operation during the ret instruction Is shown in Rg. 16. 

Before this Instruction is executed, the stack pointer stored in the SP 14 indicates the address in the stack contain- 
ing the destination address for return from the called routine. 

30 At the start of execution of this instruction, the stack pointer stored in the SP 14 is output to the XA_BUS 78 (step 
S290). The stack pointer on the XA_BUS 78 is then output to the D_ADDR_BUS 96 (step S292). A data read request 
signal from the CPU to the BCU 60 becomes active and executes an external memory read cycle. In otiner words, the 
BCU 60 follows the request signal to read the return address from the stack provided In memory, using the stack pointer 
as a memory address. This return destination address that has been read from the memory (RAM) 50 Is fetched within 

35 the CPU from tiie D_OATA_BUS 94 over the DIN signal line 86. then is output from the DIN signal line 86 to the 
LADDR.BUS 92 over the lA signal line 82 (step S294). An instruction read request signal from the CPU to the BCU 60 
becomes active and executes a read cycle with respect to the external menriory (ROM) 52 to fetch the object code of 
the return destination Instruction (step S296). 

The value of the stack pointer on the XA_BUS 78 Is one Input of the address adder 30. and constant data (+4) ere- 

40 ated by tiie immediate data generator 22 forms the otiier input of the address adder 30. The constant +4 is added to 
the value of the stack pointer on the XA_BUS 78 by the address adder 30 to aeate tiie address of the top area of the 
stack region secured by the return destination routine, and the result is output to the WW_BUS 76 (step S298). This 
address on the WW.BUS 76 (the address of the top area of the stack region secured by the return destination routine) 
Is then output to the SP 1 4 (step S300). 

45 

(8) Description of Sequential Push Instruction (pushn) and Sequential Pop Instruction (popn) 

As described previously, it has recentiy become common to have a configuration with a large number of internal 
generaliXJrpose registers, particularly in RISC CPUs, to enable processes to run rapidly witiiin the CPU itself, witiiout 
so having to access memory The CPU of this embodiment Incorporates 16 general-purpose registers, designed to 
Increase ttie speed of processing. However, if there are so many Internal registers, the contents of a large number of 
registers have to be saved during processing that saves and restores registers when an intenupt occurs or a subroutine 
is called: 

When the conlents of these registers are saved or restored in the prior art. a push instruction for storing the con-'""" 
55 tents specified by an address or a pop instruction for fetching the contents of the stack to a register is used. The actions 
of these ordinary push and pop Instructions will now be descrbed. 

The actions performed during the execution of push instructions are shown schematically in Figs. 1 7A and 1 7B and 
the actions during the execution of pop instructions are shown schematically in Figs. 18A and 18B. The actions per-, 
formed when data is transferred between a plurality of general-purpose registers and the stack will now be described. 
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with reference to Figs. 1 7A. 1 7B, 18A. and 1 8B. 

Fig. 1 7A shows the actions when a *push R1 ' instruction is executed, to write the contents of a general-purpose reg- 
ister R1 to the stack. During the execution of this instruction, the contents of the SP 14 are updated by the subtraction 
of 4 from the current value (1 000 in Fig. 1 7A is updated to 996). The contents 'a' of the general-purpose register R1 are 
5 written to 996. which Is the memory address indicated by the updated stack pointer In the SP 1 4. 

Fig. 1 7B shows the actions when a 'push R2* instruction is executed to further write the contents of a general-pur- 
pose register R2 to the stack. During the execution of this instruction, the contents of the SP 14 are updated by the sub- 
traction of 4 from the cun^ent value (996 in Rg. 1 7B is updated to 992). The contents b* of the general-purpose register 
R2 are written to 992. which is the memory address indicated by the updated stack pointer in the SP 14. 
10 Fig. ISA shows the actions when a 'pop R2' instruction is executed to fetch the contents of the stack to the general- 
purpose register R2. During the execution of this instruction, the contents t)' stored at the memory address 9921 Indi- 
cated t)y the stack pointer in the SP 14 are fetched and stored in the general-purpose register R2. The contents of the 
SP 14 are updated by the addition of 4 to the cun-ent value (the pre-execution value of 992 in Rg. ISA is updated to 
996). 

75 Finally. Fig. 1 8B shows the actions when a 'pop R V is executed to further fetch the contents of the stack to the gen- 
eral-purpose register R1. During the execution of this instruction, the contents 'a* stored at the memory address 996 
indicated by the stack pointer in the SP 14 are fetched and stored In the general-purpose register R1. The contents of 
the SP 14 are updated by the addition of 4 to the current value (the pre-execution value of 996 in Rg. 18B is updated 
to 1000). 

20 When data is transferred between a plurality of general-purpose registers and the stack in this example of the prior 
art. It Is necessary to execute push or pop instructions a.plurality of times. This is because each execution of a push or 
pop instructton can only operate on a single register. 

Therefore, when processing is performed to save data from a number of registers to the stacK or recover data from 
the stack to a number of registers, the resultant increase in the number of instructtons will lead to an Increase in the size 
25 of the object code. The number of execution steps in the program will also increase, leading to delays in the execution 
time Of processing of the program. 

The following Instruction set is provided for the CPU of this embodiment: 

pushn%Rs(18) 
30 popn%Rd(19) 

Instruction (18) is a sequential push instruction created by an assembler. It sequentially pushes the contents of n gen- 
eral-purpose registers (where n is an integer from 1 to 16) from %Rs to RO onto the stack. Instruction (19) Is a sequen- 
tial pop instructbn created by an assembler. It sequentially pops n Hems of word data (where n is an integer from 1 to 

35 16) from the stack and pushes them into general-purpose registers %Rd to RO. Each of the pushn and popn instruc- 
tions consists of op code and an operand. The operand %Rs of the pushn instruction indicates the register number of 
Rs when data is to be written to the stack from registers %Rs to RO. The operand %Rd of the popn instruction indicates 
the register number of Rd when data is to be taken from the stack and written to registers RO to %Rd. 

The bit structure of the pushn and popn instructions is shown in Rg. 19. The lowermost 4-bft field contains a code 

40 irxjicating %Rs or %Rd; any of the 16 general-purpose registers can be specified. When data is to be transferred 
between a special register and the stack, it is transferred via a general-purpose register. 

A block diagram of the hardware configuration required for executing the sequential push instruction (pushn) or 
sequential pop instruction (popn) is shown In Rg. 20. Portions necessary for the desaiption of the sequential push 
instruction (pushn) and sequential pop instruction (popn) are extracted from Fig. 1, and further necessary portions are 

45 added. Portions that are the same as those in Rg. 1 are given the same reference numbers. In this figure, reference 
number 11 denotes 16 general-purpose registers called RO to R15. These general-purpose registers 11 transfer data 
to and from the D_DATA_BUS 98. A register selection address signal 54 for selecting a register comes from a control 
circuit block 45. The stack pointer is stored in the SP 14. The value in the SP 14 can be output to the D_ADDR_BUS 
96 and the internal address bus (XA_BUS) 78 that is connected to an input of the 32-bit address adder 30. Another 

so Input of the address adder 30 Is connected to an offset signal 24 that is output from the control drcuit block 45. An out- 
put of the address adder 30 Is hekl by a latch (Add.LT) 32. then is further output to the XA_BUS 78 or the WW.BUS 
76. The WW_BUS 76 is connected to the input of ttie SP 14. There is a 4-bit counter (countx) 46 in the control circuit 
block 45. to count the number of registers involved in the data transfer. The control circuit block 45 is further provided 
with an instijciioii register (not shown in the figure), which hokls the operand %Rs or %Rd of the pushn or popn instruc- 

55 tlon and also outputs control signals in accordance with Instructions from the instruction decoder. In this figure, refer- 
ence number 60 denotes a bus control unit (BCD) that controls the input and output of data to and from external 
memory (RAM) 50, which includes the stack area and outputs READ and WRITE control signals. 

The description first concerns the operation during tiie execution of a sequential push instruction (pushn). 
When the sequential push instruction (pushn) Is executed, the contents stored in the general-purpose registers are 
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sequentially pushed into the stack starting with the register with register number %Rs to general-purpose register RO, 
as described previously with reference to Rg. 4. 

A flowchart Illustrating the operation during the pushn instruction is given in Fig. 21 . 

At the start of execution of the pushn instruction, the offset of a offset signal 24 is -4. The counter (countx) 46 is 
5 cleared to zero and the stack pointer stored in the SP 1 4 is output to the XA_BUS 78 (step S100). 

The address adder 30 then adds -4 to the value on the XA_BUS 78 and inputs the result to the latch (Add_LT) 32 
(stepSIOI). 

The value in the latch (Add_LT) 32 is then output to the D_ADDR_BUS 96. The difference between the value of 
%Rs held in the lowermost 4 bits of the instruction register and the value in counter (countx) 46 is calculated by the con- 
10 trol circuit block 45 and output to the register selection address signal 54. The register selected by this signal 54 is 
linked to the.D_DATA_BUS 98 and a write is performed with respect to the external memory (RAM) 50 (step S102). 

In a step S103, the counter (countx) 46 is compared with %Rs. If they match, the writing of registers to external 
memory has been completed and the execution of the pushn instruction ends by the writing of the value in the latch 
(MdJJ) 32 to the SP 14 through the WW_BUS 76 (step S104). 
IS If they do notmatch. the counter (countx) is incremented by 1, the value in the latch (Add.LT) 32 is output to the 
XA.BUS 78. and the processing from step SI 01 onward is repeated (step SI 05). 

The operation during the execution of a sequential pop instructton (popn) will next be descn'bed. 
When a sequential pop instruction (popn) is executed, the contents of the stack are sequentially pushed from the 
general-purpose register RO to the general-purpose register with register number %Rd, as described above with refer- 
20 ence to Rg. 4. 

A flowchart illustrating the operation during the popn instruction is shown in Fig. 22. 

At the start of execution of the popn instruction, the offset of a offset signal 24 is 44. The counter (counbc) 46 is 
cleared to zero and the stack pointer stored in the SP 14 is output to the XA BUS 78 and the D_ADDR_BUS 96 (step 
S110). 

2S The address adder 30 then adds 4 to the value In the XA.BUS 78 and inputs the result to the latch (Add LT) 32 
(step S1 11). 

An external memory read cyde is then performed. The thus-read data is written to the general-purpose registers 
1 1 through the D_DATA_BUS 98. During this time, the counter (countx) 46 is output to the register selection address 
signal 54 by the control circuit bk)ck 45 (step S1 12). 

30 In a step Si 1 3, the counter (countx) 46 is compared with %Rs. If they match, the reading from external memory to 
registers has been completed and the execution of the popn instruction ends by the writing of the value in the latch 
(Add_LT) 32 to the SP 1 4 through the WW_BUS 76 (step Si 1 4). 

If they do not match, the counter (countx) is inaemented by 1. the value in the latch (MdJJ) 32 is output to the 
XA_BUS 78 and the D_ADDR_BUS 96. and the processing from step S1 11 onward is repeated (step S115). 

35 Thus the 'pushn %Rs* instruction makes it possible to write data from registers %Rs to RO to the stack and the 
•popn %Rd' instruction makes it possible to return the necessary number of data items from the stack to registers RO to 
%Rd. Thus data from registers R3 to RO can be pushed by the execution of 'pushn %Rs*. for example. 

A further increase in efficiency is achieved in this case by the addition of a restriction over the way in which registers 
are used. In other words, the saving and restoration of register is particularly necessary when a program branches to 

40 another routine, such as during interrupt processing or when a subroutine is called. It is preferable that each routine 
called in such a case, such as another interrupt routine or subroutine, uses the registers In sequence from RO. This 
ensures that the contents of registers can be efficiently saved or restored by using the pushn instruction or popn instruc- 
tion, without having to include registers from RO to Rd or Rs that contain data tiiat does not need to be saved. All of the 
registers in this embodiment have the same functions, and there are no other restricttons on the usage of registers and 

4S the instruction, so that this restriction solves the above problem without causing further problems. 

Therefore, use of the pushn instruction and popn instruction of this embodiment makes it possible to implement effi- 
cient saving of data from registers to the stack in memory and the restoration of data from the stack to registers by a 
single instruction of either pushn or popn. This minimizes the object code size and the number of program execution 
steps, and also means that the number of execution cydes can be reduced to a single instruction fetch and the neces- 

50 sary number of data transfers, enabling a minimization of tiie number of cydes. This makes it possible to increase the 
speed of inten^upt processing routines and subroutines. 

Since the sti-uctural elements required for this execution are a count means and a simple sequence controller, it is 
possible to implement a data processing circuit with a small number of gates, so that this invention can be applied to a 
one-chip microconputef. 

55 The above description concerned the configuration required for executing the sequential push instruction (pushn) 
and sequential pop instruction (popn) when a dedicated stack pointer register is used as the SP 14, but it shoukJ be 
obvious to those skilled in the art that the present invention can also be applied to a structure in which any general-pur- 
pose register is used as the stack pointer. 
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Embodiment 2 

A block diagram of the hardware of a microcomputer in accordance with this embodiment is shown in Fig. 23. 

This miaocomputer 2 is a 32-brt microcontroller that comprises a CPU 10; ROM 52; RAM 50; a high-frequency 
5 oscillation drcuit 910; a low-frequency oscillation drcuit 920; a reset circuit 930; a prescaler 940; a 16-bit programmable 
timer 950; a 8-bit programmable timer 960; a clock timer 970: an intelligent DMA 980; a high-speed DMA 990; an inter- 
rupt controller 800; a serial interlace 810; a bus control unit (BCU) 60; an A/D converter 830; a D/A converter 840; an 
input port 850; an output port 860; and an I/O port 870; as well as various buses 92, 94. 96. and 98 that are connectied 
to these components and various pins 890. 
10 The CPU 10 has an SP that is a dedicated stack pointer register, and It decodes and executes the dedicated stack 
pointer instructions as described above. This CPU 10 has the same configuration as that of the prevtously described 
Embodiment 1 and it functions as a decoding means and execution means. 

The microcomputer of this embodiment can therefore enable efficient coding and execution of processes that act 
upon the stack pointer, with short instructions. 
IS The processes of saving and restoring registers can also be coded efficiently, so that Interrupt processing and sut>- 
routine call/return can be performed rapidly. 

The microcomputer of this Invention can be applied to peripheral equipment fbr personal computers, such as print- 
ers, or electronic equipment such as portable appliances, for example. Such applications of the invention make it pos- 
sitDle to provide inexpensive, but sophisticated, electronic equipment that is facilitated by a data processing circuit that 
20 has a fast processing speed and a highly efficient usage of memory. 

Note that this Invention Is not to be taken as being limited to the above described embodiments; it can be embodied 
in various other ways. 

Claims 

25 

1. A data processing circuit comprising: 

a dedicated stack pointer register that is used only for the stack pointer; 

decoding means for decoding object code of a group of dedicated stack pointer instructions which have object 
30 code specifying said dedicated stack pointer register as an implicit operand and which relate to processing 

based on said dedicated stack pointer register, and for outputting a control signal based on said object code; 
and 

execution means for executing said group of dedicated stack pointer instructions based on said control signal 
and the contents of said dedicated stack pointer register. 

35 

2. The data processing circuit as defined in claim 1 , wherein said group of dedicated stack pointer instructions com- 
prises a load instruction having data fbr specifying a transfer register within object code; 

said decoding means decodes said load instruction; and 
40 said execution means performs at least one of a transfer of data from a given first area in mennory to a given 

first register and a transfer of data from said given first register to said given first area, based on a memory 
address specified by said dedicated stack pointer register and a register address specified by said data fbr 
specifying a transfer register, when said load instruction is executed. 

45 3.. The data processing circuit as defined in daim 2. wherein said load instruction comprise offset data within said, 
object code that is data relating to an offset fbr specifying ttie address of said first area in said memory; and 

said execution means determines said memory address according to the contents of said dedicated stack 
pointer register and said offset data. 

so 

4. The data processing circuit as defined In claim 3, wherein said offset data comprises immediate offset data and 
data size information relating to the size of given data in menx)ry; and 

said execution means aeates at i offset by performing a leftward logical shift on said immediate offset data, 
55 based on said immediate offset data and said data size information, and determines saki memory address by 

adding said offset to the contents of saki dedicated stack pointer register. 

5. The data processing circuit as defined in any one of claims 1 to 4, 



21 



EP0 809 180 A2 

wherein said group of dedicated stack pointer instructions comprises a stack pointer niove instruction for hav- 
ing move data within the object code thereof and for moving said stack pointer; 
said decoding means decodes said stack pointer move instruction; and 

said execution means modifies the contents of said dedicated stack pointer register based on said move data 
5 when sati stack pointer move instruction is executed. 

6. The data processing circuit as defined in daim 5. wherein said move data comprises immediate data for said stuck 
pointer move instruction; and 

10 said instruction execution means performs at least one of a process of adding said immediate data for saki 

stuck pointer move instruction and the contents of sakJ dedicated stack pointer register and a process of sub- 
tracting said immediate data for said stuck pointer move instruction from the contents of said dedicated stack 
pointer register. 

IS 7. TTie data processing circUt as defined in any one of claims 1 to 6, further comprising: 
a plurality of registers provided in a contiguous sequence; 

wherein said group of dedicated stack pointer instructions comprises at least one of a sequential push instruc- 
tion and a sequential pop instruction having data for specifying a plurality of registers in the object code tiiereof ; 
said decoding means decodes at least one of said sequential push instructbn and said sequential pop instruc- 
tion: and 

said instruction execution means performs at least one of a process of executing a plurality of sequential 
pushes of data from said plurality of registers to a stack provided in memory and a process of executing a plu- 
rality of sequential pops of data from said stack to said plurality of registers, based on the contents of a memory 
address specified by said dedicated stack pointer register and said data for specifying a plurality of registers, 
during the execution of at least one of said sequential push instruction and said sequential pop instruction. 

8. The data processing circuit as defined In daim 7, further comprising: 

30 n general-purpose registers specified by register numbers 0 to n-1 ; 

wherein object code of at least one of sakJ sequential push instruction and sad sequential pop instruction com- 
prises a final register number to whtoh one of sakj register numbers is specified, as said data for spedfying a 
plurality of registers; and 

said execution means performs at least one of a process of executing a plurality of sequential pushes of data 
3S into a stack provided in memory from a plurality of registers starting from register 0 to a register specified by 

said final register number and a process of executing a plurality of sequential pops of data from saki stack to a 
plurality of registers starting from regteter 0 to a register specified by said final register number, based on the 
contents of a menrx)ry address specified by said dedicated stack pointer register. 

40 9. The data processing circuit as defined in daim 7 or 8. 

wherein sakJ execution means comprises: 

write means for writing the contents of a given register that is one of said plurality of registers to a stack 
45 provided in memory, based on a memory address specified by said dedicated stack pointer register; 

number-of-times-written count means for counting the number-of -times-written of saki write means; and 
comparison means for comparing said number-of-times-wrrtten counted by saki count means witii the 
value of said data for specifying a plurality of registers; 

so wherein saki write means comprises: 

write memory address generation means for adding a first input and a second input by an adder, and for 
generating a write memory address for soecifyinq a write destination; 

first input control means for provkiir^ control'such that tiie first input of sad adder is the contents of saki 
ss dedicated stack pointer register at the start of execution of a sequential dedk:ated instruction, and Is a write 

address generated subsequentiy by a write address generation means; 

second input control means for oulputting an offset used during the writing of one word from said stack to 
tiie second input of saki adder; and 

write means for writing to said stack the contents of a register specified by subtraction processing which 
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uses said data for specifying a plurality of registers and said numt>er-of-timeS'written, based on said write 
memory address; 

whereby the writing of the contents of said plurality of registers to said stack and the ending of said writing 
are controlled based on the comparison result of said oonparison means. 

5 

1 0. The data processing drcuit as defined in any one of claims 7 to 9, 

wherein said instruction execution means comprises: 

10 read means for reading the contents of a stack provided in memory based on a memory address specified 

by said dedicated stack pointer register, and storing said contents in a given register of said plurality of reg- 
isters: 

number-of-times-read count means for counting the number-of -times-read of said read means; and 
comparison means for comparing said number-of-times-read counted by said count means with the value 
75 of said data for specifying a plurality of registers; 

wherein said read means comprises: 

write memory address generation means for adding a first input and a second input by an adder, and for 
generating a write memory address for specifying a write destination; 

first input control means for providing control such that the first input of said adder is the contents of said 
dedicated stack pointer register at the start of execution of a sequential dedicated Instruction, and is a read 
address generated subsequently by a read address generation means; 

second input control means for outputting an offset used during the writing of one word from said stack to 
the second input of said adder; 

read means for reading the contents of said stack based on said read memory address and storing said 
contents in a register specified based on said number-of-tlmes-written; 

whereby the reading of the contents of said stack and the ending of said reading are controlled based on 
the comparison result of said comparison means. 

11. The data processing circuit as defined in any one of claims 1 to 10; further comprising: 

a program counter register used only for the program counter; 

wherein said group of dedicated stack pointer instructions comprises branch instructtons that are an instruction 
35 for branching to a subroutine arxi a return instruction from said sukmutine; 

said decoding means decodes said branch instructions; 
said instructton execution means comprises: 

means for executing at least one of a process of saving the contents of said program counter register to a 
40 given second area of said stack provided in memory and a process of restoring the contents of said sec- 

ond area to said program counter register, based on a memory address specified by said dedicated stack 
pointer register, during the execution of said branch instruction; and 

means for updating the contents of said dedicated stack pointer register based on said saving and resto- 
ration. 

45 

1 2. A data processing circuit comprising a plurality of registers provided in a contiguous sequence and a stack pointer 
allocated to one of a plurality of general-purpose registers, said data processing circuit further comprise: 

means for decoding object code of an instruction that is at least one of a sequential push instruction and 
so sequential pop Instruction that each have data for specifying a plurality of registers witiiin object code and for 

outputting a conti'ol signal on the basis of said object code; and 

means for performing at least one of a process of executing a plurality of sequential pushes of data from said 
plurality of registers to a stack provided in memory and a process of executing a plurality of sequential pops of 
data from said stack to said plurairty or registers.' t>ased on said control signal, the contents of a memory 
55 address specified by said dedicated stack pointer register, and said data for specifying a plurality of registers, 

during the execution of at least one of said sequential push instruction and said sequential pop instruction. 

13. The data processing circuit as defined in any one of claims 1 to 12. wherein said circuit uses instructions of a 
reduced instruction set computer. 
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14. The data processing circuit as defined in any one of claims 1 to 13, wherein: 

a fixed-length instruction is decoded and execution Is based on said fixed-length Instruction. 

5 15. A microcomputer comprising the data processing circuit as defined In any one of claims 1 to 14, storage means, 
and Input and output means for inputting data from and for outputting data to external devices. 

1 6. The miaocomputer as defined in daim 1 5, 

10 wherein a program that is executed thereby uses a program language that secures a storage region for auto- 

variables by using said stack pointer. 

17. Electronic equipment comprising the microcomputer defined In claim 15 or 16. 
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FIG. IB 
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FIG. 3 
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FIG. 5 
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FIG. 6A 



SP- relative load instruction 



612 

_L_ 



Op code 



6 bits 



614 



Immedlale 
offset datq 



'6 bits 



616 



Register 
number 



4 bits 



610 



F/G. SB 

General -purpose lood instruction 



620 



622 

\ 


624 

\ 


626 
\ 


628 ^ 

\ 


Op code 


Immediate 


First 


Second 


offset data 


register 


register 




number 


number 



6 bits 



'6 bits 



4 bits 



4 bits 



31 



EP 0 809 180 A2 



FIG. 7 
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FIG. 8 
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FIG, II 
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FIG. 12 
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FIG. 13 
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FIG. 14 
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FIG. 15 
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FIG. 16 
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FIG. I7A 
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FIG, 19 
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Flowchart of pop execution 




offset«4 
countx= 0 

SP — XA-BUS 
SP-*-D.ADDR.BUS| 



SIIO 



Add_LT-^XA_ 


BUS+ offset 






do read bus cycle 

D - DATA - BUS RF { count x ) 


<^countx 


' SI 13 
Y 



Sill 



.SI 12 



Add.LT-^WW-BUS 
WW_BUS-^SP 



SI 15. 



countx-^countx + I 
Add_LT-^XA_BUS 
Add- LT-^D_ADDR-BUS 



6 



47 



EP0809180A2 



FIG. 23 
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